Automatic selection of preprocessing methods for improving predictions on mass spectrometry protein profiles.

نویسندگان

  • Richard C Pelikan
  • Milos Hauskrecht
چکیده

Mass spectrometry proteomic profiling has potential to be a useful clinical screening tool. One obstacle is providing a standardized method for preprocessing the noisy raw data. We have developed a system for automatically determining a set of preprocessing methods among several candidates. Our system's automated nature relieves the analyst of the need to be knowledgeable about which methods to use on any given dataset. Each stage of preprocessing is approached with many competing methods. We introduce metrics which are used to balance each method's attempts to correct noise versus preserving valuable discriminative information. We demonstrate the benefit of our preprocessing system on several SELDI and MALDI mass spectrometry datasets. Downstream classification is improved when using our system to preprocess the data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Efficient Framework for Accurate Arterial Input Selection in DSC-MRI of Glioma Brain Tumors

Introduction: Automatic arterial input function (AIF) selection has an essential role in quantification of cerebral perfusion parameters. The purpose of this study is to develop an optimal automatic method for AIF determination in dynamic susceptibility contrast magnetic resonance imaging (DSC-MRI) of glioma brain tumors by using a new preprocessing method.Material and Methods: For this study, ...

متن کامل

Preprocessing of tandem mass spectrometric data to support automatic protein identification.

Liquid chromatography tandem mass spectrometry is a major tool for identifying proteins. The fragment spectra of peptides can be interpreted automatically in conjunction with a sequence database search. With the development of powerful automatic search engines, research now focuses on optimizing the result returned from database searches. We present a series of preprocessing steps for fragment ...

متن کامل

Protein profiling and analysis of drug sensitive and multidrug resistant isolates of Mycobacterium tuberculosis by native polyacrylamide gel electrophoresis and mass spectrometry

Introduction: Tuberculosis (TB) remains a deadly infectious disease despite all the efforts to reduce its incidence. Spread of multidrug resistant TB has seriously undermined the efforts to control the disease globally. In this study protein expression profile of MDR and sensitive isolates of MTB were analyzed and compared in order to identify proteins, which could be used in prevention, diagno...

متن کامل

A New Approach for the Analysis of Mass Spectrometry Data for Biomarker Discovery

In the last few years a growing interest has been devoted to disease diagnosis based on proteomic profiles of body fluids generated by mass spectrometry. In this work, we will present a new approach for their analysis for biomarker discovery. In particular, we will describe a new strategy for the analysis of SELDI/MALDI-TOF serum data based on the following three steps: i) data-preprocessing, i...

متن کامل

A New Hybrid Feature Subset Selection Algorithm for the Analysis of Ovarian Cancer Data Using Laser Mass Spectrum

Introduction: Amajor problem in the treatment of cancer is the lack of an appropriate method for the early diagnosis of the disease. The chemical reaction within an organ may be reflected in the form of proteomic patterns in the serum, sputum, or urine. Laser mass spectrometry is a valuable tool for extracting the proteomic patterns from biological samples. A major challenge in extracting such ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • AMIA ... Annual Symposium proceedings. AMIA Symposium

دوره 2010  شماره 

صفحات  -

تاریخ انتشار 2010